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Radio source stacking and the infrared / radio correlation at 
microjy flux densities 
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ABSTRACT 

We investigate the infrared / radio correlation using the technique of source stacking, in order 
to probe the average properties of radio sources that are too faint to be detected individually. 
We compare the two methods used in the literature to stack sources, and demonstrate that the 
creation of stacked images leads to a loss of information. We stack infrared sources in the 
Spitzer extragalactic First Look Survey (xFLS) field, and the three northern Spitzer Wide-area 
Infrared Extragalactic survey (SWIRE) fields, using radio surveys created at 610 MHz and 
1 .4 GHz, and find a variation in the absolute strength of the correlation between the xFLS and 
SWIRE regions, but no evidence for significant evolution in the correlation over the 24-|xm 
flux density range 150 \iiy - 2 mJy. We carry out the first radio source stacking experiment 
using 70-tim-selected galaxies, and find no evidence for significant evolution over the 70-tim 
flux density range 10 mJy - 100 mJy. 
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1 INTRODUCTION 

There is a well-known relationship between the infrared and radio 
flux densi ties of star-forming galaxies (the 'infrared / radio correla- 
tion' , e.g. lHelouT Soifer & Rowa n^Robinsonlfl 9851 : lAppleton et al.J 
|2004) that is thought to be the result of both the infrared and ra- 
dio emission from star-forming galaxies being related to the rate of 
massive star formation. Recent st udies of the infrared / radio corre- 
lation within nearby galaxies (e.g. lMurphv et alj2006t) give weight 
to this argument, as the radio emission has been shown to repre- 
sent a smeared out version of the infrared emission. This has been 
attributed to the diffusion of cosmic ray electrons away from star- 
forming regions. The correlation is seen to apply to a wide range 
of galaxies associated with star-formation, but not to those sources 
whic h are associated wit h Active Galactic Nuclei (AGN) activity 
(e.g. Ide Jong et alj|l985l : ISanders et allll988l: ISopp & Alexander 
ll99l| - |Rovetal.||l998h . 

Studies at high redshift are tentatively finding that the in- 
frared / radio correlation does not deviate significantly fr om that 
seen i n the local Universe , at least out to z ~ 1 (e.g. iGarrettl 
|2002| ; lAppleton etai] |2004 [Fraver et al.l 120061) . and potentially 
to z ~ 3.5 Jlbar et al.l 120081) . The correlation has been shown 
to remain invariant over m ore than 5 orders of magnitude (e.g. 
lYun. Reddv & Condonll200lh . however studies that rely on sam- 
ples of sources which are detected at both infrared and radio wave- 
lengths will naturally be mainly sampling luminous galaxies, and 
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there has been relatively little work carried out on testing the in- 
frared / radio correlation within fainter galaxies. 

The technique of source 'stacking' is well established as a 
method for combining data from many individual objects in order 
to study the statistical properties of sources which would otherwise 
be below the detection limit for a particular survey. By construct- 
ing a list of source positions based on prior information obtained 
at another frequency, a series of small 'cut-out' images can be cre- 
ated, centred on these source positions, of sources that may be be- 
low the noise level (and therefore undetected). If N of these images 
are combined, each with a local noise of o", then the noise level 
of the stacked image will be expected to decrease approximately 
as a/vN, which quickly allows sources below the original detec- 
tion threshold of the survey to be studied. Stacking h as previously 
been carried out using optical (e.g. lZibetti et alj2005l) and infrared 
(e.g. lZheng et aT] l2006l . [2007h images, as well as in the radio. The 
large covera ge area of the Faint Images of the Radio Sky at Twenty- 
cm (FIRST; becker. White & Helfanj [l995h survey h as led to a 
number of stacking experiments being carried out (e.g. IWals et all 
l2005t Ide Vries et al.1l2007l : IWhite et alj[2007l : iHodge et alJl2008T) . 
looking at the radio properties of quasars and LL-AGN, and some 
studies of star-forming gala xies have taken place using smaller , 
but deeper radio surveys (e.g.lBovle et alj |2007; Ivisonetal hoot 
iBeswick et al J2008t ICarilli et al J2008I) . 

There have been two stacking studies of the infrared / ra- 
dio correlation for galaxies which are detected in the infrared, but 
are below the detection limits of their radio surveys. The strength 
of the infrared / radio correlati on can be quantified by the loga- 
rithmic flux density ratio 924 ( lAppleton et alj|2004l . and see Sec- 
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Table 1. A summary of the properties of the radio surveys used in this work. The area and resolution of each survey, and the noise level at the centre of each 
image are given, along with the primary reference paper, which should be consulted for further details. 



Field 


Instrument 


Frequency 


Area 


Resolution 


Position Angle 


Noise level 


Reference 








(deg 2 ) 


(arcsec 2 ) 


(deg) 


(|jjy beam ) 




xFLS 


VLA 


1.4 GHz 


4 


5.0x5.0 





23 


Condon et al. (2003) 


xFLS 


GMRT 


610 MHz 


4 


5.8x4.7 


60 


30 


Garn et al. (2007) 


ELAIS-N1 


GMRT 


610 MHz 


9 


6.0x5.0 


45 


40 / 7<£ 


Garn et al. (2008a) 


Lockman Hole 


GMRT 


610 MHz 


5 


6.0x5.0 


45 


60 ~ 


Garn et al. (2008b) 


ELAIS-N2 


GMRT 


610 MHz 


6 


6.5x5.0 


70 


90 


Garn et al. (2009) 



a Deep / shallow regions respectively. 



tion l2.3t . and the resu l ts from these studies appear to be inconsis- 
tent, with iBovle et al.l d2007l) finding a significantly higher value 
of g24 than is observed in source s which are detected in both wave- 
bands, an d lBeswick et"aD (2008) finding a significantly lower value 
of g24 , and a tentative evolution in the value of ^24 with 24-iim 
flux density. In this work we describe a stacking study of the in- 
frared / radio correlation, using the sensitive Spitzer Space Tele- 
scope dWerner et alj|2004h observations of the Spitzer extragalac- 
tic First Look Survey field (xFLS) an d the Spitzer Wide-a rea In- 
frared Extragalactic survey (SWIRE; lLonsdale et alj |2003) fields. 
The three northern SWIRE fields are the European Large- Area ISO 
Survey-North 1 (ELAIS-N1), -North 2 (ELAIS-N2) and Lockman 
Hole regions, and we take radio data fr om a 1.4-GHz Very Large 
Array (VLA) survey of the xFLS field dCondon et aitoosl) . and 
610-MHz Giant Metrewav e Radio Teles cope (GMRT) sur veys of 
the xFLS dGarn et alj|2007l) ELAIS-N1 dGarn et alj|2008al), Lo ck- 
man Hole dGarn etalj|2008bl) and ELAIS-N2 dGarn et alj|2009l . in 
prep.) regions. 

In Section [2] we describe the two stacking techniques which 
are used in the literature, and calculate the values of ^24* using 1.4- 
GHz data from within the xFLS field. In Section [3] we extend this 
work to the four 610-MHz survey fields, and demonstrate that AGN 
contamination is not a significant problem for our infrared samples. 
We compare the results from our four fields in Section [4] and find 
a field-to-field variation in the infrared / radio correlation, which 
is too great to be expl ained through cos mic variance. We compar e 
our results to those o f lBovle et alJd2007l) and lBeswick et alj (2008). 
and discuss potential explanations for the systematic variation. 



2 STACKING TECHNIQUES 

Previous stacking studies of the infrared / radio correlation have 
been carried out at 1.4 GHz, and in order to compare our results to 
those in the literature we initially stack infrared sourc es within the 
xFLS field using the 1.4-GHz image of the region ( Cond on" et al.l 
l2003h . A summary of the characteristics of this survey, and the 
other radio surveys used in this work, is given in Table [T] Posi- 
tional information for t he infrared sources was taken from the 24- 
|i.m source catalogue o f lFaddaetalJd2006l) , which contains 16,905 
extragalactic point sources above a signal-to-noise (S/N) of 5. The 
source catalogue is 50 per cent complete over the field at a 24-iim 
flux density of 300 pJy, with a flux density limit of 210 ujy, while 
a deeper verification region, covering approximately 10 per cent of 
the field, is 50 per cent complete to 150 (i.Jy, with a flux density 
limit of 120 ixjy. 

Of the 16,905 sources in the catalogue, 14,820 are located 



within the 1.4-GHz image. A small (~ 0.1 arcsec) positional off- 
set between the 24 u.m catalogue and the radio image was identi- 
fied and removed before performing any stacking. The xFLS 24- 
[im catalogue does not classify sources by type, however we will 
demonstrate in Section [3~2l that there is no significant contamina- 
tion in the catalogue from stars or AGN. There are two principal 
methods used in the literature to carry out stacking experiments, 
which are described in Sections[2.1landl2T2l 



2.1 Measuring the flux density of individual sources 

The first method for carrying out source stacking measures the ra- 
dio flux density at each of the individual source positions, bins the 
sources by their infrared flux density, and uses the statistical distri- 
bution of radio flux density for sources within an infrared flux den- 
sity bin to calculate the typical source properties. While the individ- 
ual measurements will be very uncertain, particularly for sources 
near to, or below the noise level , the statistical properties of the 
distribution should be robust (e.g. lWhite et al.|[2007T) . 

The resolution of the 1.4-GHz xFLS image is 5 x 5 arcsec 2 . 
In order to calibrate the flux density measurements, a circular aper- 
ture with radius of 4 arcsec was centred on the location of 79 mod- 
erately bright radio sources, and the total flux density within the 
aperture measured. An aperture correction factor was then calcu- 
lated, in order to match the meas ured flux densities o f the bright 
sources to their catalogued values dCondon et al.l2003l) . Any errors 
in flux density measurement will propagate through the remainder 
of the analysis, so the sources that were used for this calibration 
were required to be: 

(i) Brighter than 1 mly total flux density, so that sources were 
detected with high significance and the catalogued flux density 
measurements were known to be accurate. 

(ii) Fainter than 10 mjy total flux density. This requirement is 
very important when calibrating the aperture size for the GMRT 
images in later sections, sin ce > 10 mjy sour ces can be strongly 
affected by phase errors (e.g. lGarn et"ai]|2008ah . 

(iii) Within the central region of the mosaic, so that any edge 
effects and increasing noise levels due to the varying primary beam 
would not affect the flux density calibration. 

(iv) Unresolved, with a ratio of total to peak flux density < 
1.5. The faint star-forming galaxies being stacked are expected to 
be un resolved, with a typical size of ~ 1.5 arcsec dBeswick et al.l 
2008) and a stacking analysis would not be appropriate for ex- 
tended sources due to the different flux density distribution for each 
object. 
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The calculated correction factor for a 4-arcsec aperture was 1.37 
for the VLA xFLS image (with similar values found for the four 
GMRT images which are used later in this work). This correction 
factor has been calculated only from sources with flux densities be- 
tween 1 and 10 mjy, and may not be appropriate for fainter radio 
sources - if this is the case, then varying the aperture size (and 
hence the correction factor) will lead to different results. We tested 
aperture sizes between 2 and 10 arcsec - after applying the cor- 
rection factor, the flux density measurements for the faint sources 
were consistent with each other for aperture sizes between 2 and 
5 arcsec. At greater sizes, the radio flux density calculated for the 
faint sources began to increase. We used a 4 arcsec radius aperture 
for all stacking measurements described in this work, and all mea- 
surements made within the aperture have had the correction factor 
applied unless otherwise stated. 

The flux density measurement technique described in this sec- 
tion sums all of the radio flux density within a circular aperture, 
irrespective of its origin. There is the possibility that a 4 arcsec ra- 
dius aperture is sufficiently large that source confusion may be a 
problem (where the radio emission from more than one source is 
incorrectly assumed to all result from a single 24- urn target). In or- 
der to test for so urce confusion, w e used the Spitzer Infrared Array 
Camera (IRAC; iFazio et alj|20()4) observations of the xFLS field 
to identify nearb y counterparts to the 24-jxm sources. The IRAC 
source catalogue dLacv et alj|20050 contains 103,193 sources with 
detections in at least one of the 3.6-, 4.5-, 5.8- and 8-tr.m bands. 

We selected a sample of 9062 24- tun sources which were lo- 
cated within the centre of the IRAC observations, and measured 
the number of IRAC counterparts within 4 arcsec of each 24- ^m 
source. While some of the 24-tt.m sources do not have an IRAC 
counterpart (917; 10.1 per cent), the majority of the sources had 
a single IRAC counterpart within 4 arcsec (7712; 85.1 per cent), 
with only 409 sources (4.5 per cent) having two counterparts and 
24 sources (0.3 per cent) having three. We conclude that confusion 
from nearby faint sources will not significantly affect the radio flux 
density measurements. 

Fig.[TJa) shows the measured (aperture-corrected) 1.4-GHz ra- 
dio flux densities for each source in the 24- tun catalogue, along 
with ±4(7 (where 4(7 is the detection limit for individual radio 
sources). The figure is plotted on a log-linear scale in order to show 
the range of radio flux density that is measured, which can be nega- 
tive due to the effects of noise. The decreased number of sources in 
Fig-ERand later figures displaying xFLS data) which have a 24- tun 
flux density of < 300 |jJy is due to the smaller coverage area of the 
deep verification region of the survey. 

In order to look at the statistical properties of sources, we bin 
them logarithmically by their 24-jxm flux densities, and calculate 
the median infrared and radio flux density in each bin. The me- 
dian estimator is more robust to outliers than the mean, and we will 
demonstrate that the median is the most appropriate choice for this 
data in Section |2~2l We plot the median binned flux densities in 
Fig. [TJa), with distribution widths which denote the inter-quartile 
range (IQR) for each bin, a more robust measurement of the range 
than the standard deviation. 

For comparison, Fig.QJb) shows the measured radio flux den- 
sities from 14,820 random source positions in the image - a few 
locations have a measured flux density above the 4(7 detection 
limit and appear to have a source present, but the remainder of the 
random positions have measured flux density below the detection 
threshold. The median radio flux density for all bins is consistent 
with zero. 
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(a) Measured and median binned radio flux density for the 14,820 
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24-|-ui i sources within the xFLS 1 .4-GHz image of Con don et alj 
(2003). Note that the majority of sources are below the limit at 
which they could be detected individually, as shown by the upper 
dotted line, and would not be included in a conventional analysis. 
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(b) Measured and median binned radio flux density for 14,820 random 
source positions, with each position assigned a 24-|am flux density 
from the catalogue at random. The radio flux density in each bin is 
consistent with zero. 

Figure 1. The 1.4-GHz radio flux density for each source in the 24-|am 
catalogue, measured as described in Section ffj] (grey dots), the values of 
±4o" (horizontal black dotted lines) and the median values of flux density 
within each bin (black points with errors). The error bars denote the size 
of the IQR of radio flux density in each bin. The vertical striping seen in 
this and later figures is an artefact of the quoted precision of the 24-|am 
catalogue. The decrease in the number of sources seen below a 24- \xm flux 
density of 300 |xly is due to the smaller coverage area of the verification 
region - see text for more details. 



2.2 Creating stacked images 

The second method used in the literature to measure the flux density 
of sources via stacking is to create a 'stacked image' that is repre- 
sentative of the average properties of sources within each infrared 
flux density bin, and then calculate the radio flux density directly 
from that image. Small 'cut-out' images are generated, centred on 
each source within a flux density bin, and the value of each pixel in 
the stacked image calculated from the mean or median of the distri- 
bution of flux density that is seen from the equivalent pixels in the 
N cut-out images. 

In order to compare this technique to the one described in 
Section |2~T1 we create 41 x 41 pixel 2 (61.5 x 61.5 arcsec 2 ) cut- 
out images which are centred on each of the 24-iim sources 
within the xFLS field, and create mean and median stacked im- 
ages, using the same bins as in Section |2~T| The noise of the me- 
dian stacked images wa s calculated using SOURCE EXTRACTOR 
l lBertin & Arnoutsil 1996th and is a good fit to a 1 / \/N relationship, 
although with an initial noise level of 26 ttJy beam - 1 , 13 per cent 
larger than the value of 23 \iJy beam -1 quoted by Cond on et al.1 
(2003). The bin containing the most sources has a noise level of 
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(a) 150-201 pjy. (b) 201 -268 pJy. (c) 268-359 pJy. (d) 359-479 pjy. (e) 479-641 [jjy. (1) 641 -857 (jjy. 



Figure 2. Median stacked 1.4-GHz radio images for the faintest six flux density bins. All images have a size of 61.5 x 61.5 arcsec (41 x 41 pixel 2 ). The 
24- [am flux density range of sources used to create the stacked image are given below each sub-image, with further details in Table [2] The grey-scale ranges 
between —2 and 20 pjy beam -1 . 




(a) 150-201 pjy. (b) 201 -268 iiJy. (c) 268-359 pJy. (d) 359-479 i-Jy. (e) 479-641 [jjy. (f) 641 -857 (jjy. 



Figure 3. Mean stacked 1.4-GHz radio images for the same 24- ^un flux density bins shown in Fig. [2] As in Fig. [2] images are 61.5 x 61.5 arcsec 2 and the 
grey-scale ranges between —2 and 20 \xSy beam -1 , but the presence of a few bright sources in each stacked image make the resultant mean images no longer 
representative of the 'typical' sources within a flux density bin. 



Table 2. The infrared flux density range, number of sources in each bin N, 
median value of 1.4-GHz flux density Si. 4 and the noise level o~ for the six 
median stacked images shown in Fig. [2] 



Bin 


S24 


N 


SlA 


a 




(nJy) 




(nJy) 


( |ajy beam - ) 


1 


150-201 


466 


21.2 


1.21 


2 


201 - 268 


855 


25.2 


0.80 


3 


268 - 359 


2565 


30.5 


0.59 


4 


359 - 479 


4583 


39.0 


0.42 


5 


479-641 


2795 


53.7 


0.49 


6 


641 - 857 


1476 


72.2 


0.64 



424 njy beam , more than 50 times lower than the noise level of 
the VLA xFLS image, and ~ ten times lower than the most sensi- 
tive VLA observations to date. 

In Fig. [2] we show the median stacked images created for the 
six faintest bins, along with the range of 24-p.m flux density that 
the bin covers. The number of sources in each bin, the total flux 
density of the stacked source and the noise level of the stacked 
image are listed in Table [2] There is a clearly- visible point source 
seen in each image, with a circular appearance given by the 5 x 
5 arcsec 2 resolution of the original VLA mosaic. In contrast to this, 
Fig. [3] shows the mean stacked images for the same six bins. All 
images are noticeably noisier than their median equivalents, and 
bright sources away from the centre of the cut-out images have a 
much greater effect on the stacked images. The noise level of the 
mean images is ~ 1.5 — 3 times the noise of the median images, 
and the mean images are not representative of the typical sources 
within each flux density bin, but are strongly biased by a few bright 
radio sources. 

We calculated the radio flux density of the stacked images, and 
compared the results to the median radio flux density in each bin as 
calculated in Section |2~T| The median results are in good agreement 
with each other, as is expected for point sources and a Gaussian 
noise distribution - the flux density measured from the median im- 



age is equivalent to the median of the flux densities measured from 
the individual images. While the two methods for estimating the 
median radio flux density give equivalent results, creating stacked 
images means that the information on the IQR of the flux density 
distribution is discarded. We therefore use the median binned flux 
density within an aperture for all future flux density measurements 
in this work. 

2.3 Calculating median values of #24 

The infrared / radio correlatio n can be quantified th rough the loga- 
rithmic flux density ratio dAppleton et al.ll2004h . where 

qm = login fjj^j > (!) 

and Sir is the infrared flux density detected within either the Spitzer 
24-p.m or 70- pm bands. In Fig.|4{a) the value of 1^24 for each source 
in the 24-pm catalogue is shown, along with the 'median binned' 
value of q24, which is calculated directly from the median values 
of infrared and radio flux density within each flux density bin. The 
error bars on #24 come from the radio flux density at the IQR for 
each flux density bin, and therefore represent the same range as 
was shown in Fig.Q] Since sources with an apparent value of radio 
flux density that is negative (of which there are many - see Fig. [TJ 
cannot be plotted on this figure, the median value of #24 does not 
follow the trend of the individual plotted points directly - this can 
be seen by the asymmetric distribution range for the fainter flux 

dens ity bins. 

Appl eton et ai] d2004[) carried out the first study of the in- 
frared / radio correlation in the x FLS field, using a 1.4-GHz ra- 
dio catalogue dCondon et al]|2003h . and early 24-p.m Spitzer data. 
508 sources were fou nd with both 24-p.m and 1.4-GHz detections. 
Apple ton etal] d2004h calculated a mean value of q 1A = 0.84± 0.28 
for these sources, which is shown on Fig.f4ja) for comparison with 
our dat a. Note that the i nitial survey data which has been used for 
both the lAppleton et al.l d2004h study and the study described in this 
section is the same. It is clear t hat our stacking results give a value 
of #24 that is greater than the lAppleton et alj d2004t) mean value 
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(i.e. we find that sou rces are more radio-quiet than was found by 
Apple ton et alj 2004 ), wh ich is to be expected due to the fact that 
the Appleto n et alJ d20"04T) sample contained only those sources de- 
tected at both frequencies, and is therefore subject to significant 
sample bias. 

In order to test the effects of this bias, we created a catalogue 
of 1.4-GH z radio sou r ces in the xFLS field, using the method de- 
scribed in lGarn et al. I J2007t) . and requiring that sources could be 
detected at the 4o~ level. We then repeated the stacking experiment 
described above, using only the 2,558 24-jxm sources which were 
also present in this 1.4-GHz catalogue. Fig. |4fb) shows the indi- 
vidual values of 1724 f° r eacn source, calculated using the method 
described in Section |2~T1 along with the median binned values of 
<?24- The 4o~ limit is shown on this plot, although some sources 
have a measured flux density below this limit (i.e. a value of ^24 
above the dashed line) due to the effects of noise on the aperture 
flux density measurements. An artificial variation of 1^24 with in- 
frared flux density is now seen, although this is purely a selection 
effect and clearly demonstrates the disadvantage of only studying 
the bright sources which can be detected in radio images. Now that 
the median values of ^24 are being calculated from sources that can 
be clearly seen above the noise, the binned data follows the trend of 
individual sources and the flux density distribution is both narrower 
and more symmetric. 

A comparison of the I Appleton et alj J2004h results and the 
binned data from detected sources in Fig. |4fb) shows good agree- 
ment, with th e binned results spa nning the width of the distribu- 
tion found by Applet on et alj d2004l) . This suggests that the stack- 
ing method is not significantly biasing the radio flux density which 
is measured for sources. We overlay the valu es o f 1774 found in 
the sta cking experiments of iBovle et alj J2007h and lBeswick et al.l 
(2008) on Fig . [4] fo r comparison with our data. Note that the 
iBeswick et alj 12008) results are not suffering from the selection 
effect which was described above, resulting from only using de- 
tected sources, despite appearing to follow a similar trend to our 
binned data in Fig. |4fb) - 80 per cent of their infrared sources can 
be detected above 3o" in their radio image, and all sources have 
been used for the stacking experiment (see Section l4~6l for further 
details). All three stacking experiments find very different values 
for ^24> with the sources in the study of IBovle et alj (|2007l) being 
more r adio-quiet than our data, and the sources in the lBeswick et alj 
(2008) study being m ore radio-loud. A detai led comparison of our 
results with those of IBovle et"aH J2007t) and lBeswick et alj d2008h 
will be performed in later sections. 
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(a) qiA, calculated from all 14,820 sources in the 24-|am catalogue. 
There is no significant variation in the median value of over the 
range of S24 plotted. 




10- 



0.01 



3 x 10~ 4 0.001 0.003 
24-/jm flux density (Jy) 
(b) ^24, calculated only from the 2,558 sources that are also detected 
above 4o~ at 1.4-GHz. The median value of ^24 follows the trend of 
individual sources. The diagonal dashed line represents a constant 
radio flux density of 4 x 23 |xly. Bins containing fewer than 20 
sources have been excluded from the figure. 

Figure 4. The effects of varying the selection criteria when calculating 1724 ■ 
Values of 1724 for individual sources are shown (grey dots), along with the 
median binned value of qn (black points with errors denoting the IQR of 
the data). Note that sources with an apparent radio flux density < cannot 
be shown on these plots. The decrease in the number of sources seen below 
a 24- |am flux density of 300 |ajy is due to the smaller coverage area of the 
verification survey - see Sectionf2]for more details. The values of 524 found 
in the stacking experiments of lBovle et al . 12007) (thick pink dashed line; 
CDFS field on l y, 'all -source' data - see Section PTBI for further details) and 
IBeswick et alj J2OO8I) (thick orange solid line) are plotted for comparison. 
The hori zontal black lines repr esent the mean value of 1724 of 0.84 ± 0.28 
found bv lAppleton et all QPiJjj) - see text for more details. 



3 610-MHZ STACKING 

3.1 Spitzer extragalactic First Look Survey field 

The strength of the infrared / radio correlation can be quantified at 
610 MHz in a similar way to at 1.4 GHz, using 

9m = lo gio (|^) > (2) 

where Seio is the radio flux density measured at 610 MHz. There 
are 13,812 24-p.m sources located within the r egion covered b y 
the 610-MHz GMRT image of the xFLS field jGarn et alj|2007h . 
While the noise level of the 1.4-GHz image is fairly uniform across 
the field, the GM RT image shows more variation (see Fig. 1 of 
iGarn et alj|2007l) . In order to test whether this varying noise af- 
fects the calculated values of q' 24 we carried out the stacking pro- 
cedure described in Section |2~T1 on the GMRT image, using only 



those sources within a radius of 25 arcmin from the centre of the 
mosaic. This analysis was repeated in 12.5 arcmin increments up 
to 1.25 deg - increasing the radius had no significant effect on the 
median values of q 24 , indicating that the varying noise level within 
this image was not affecting the results. 

We varied the aperture size, and verified that a 4 arcsec aper- 
ture was still appropriate for the GMRT image. Fig. [5J a ) shows the 
610-MHz flux density measured for each source, and the median 
binned values, for comparison to Fig.[TJa). Due to the increased and 
varying noise level across the 610-MHz image, there is a greater 
dispersion in the individual values of flux density being measured, 
and the IQR of radio flux density for each bin is larger. This is more 
clearly seen in Fig.[5jb), where the flux density measured at 13,812 
random source positions is plotted - since the noise level is not uni- 
form, the individual flux density measurements do not lie as tightly 
inside the ±4o~ lines as before (where o~ = 30 [jJy beam -1 , the 
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(a) Measured and median binned radio flux density for the 13,812 
24- |am sources within the xFLS 610-MHz image. 
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(b) Measured and median binned radio flux density for 13,812 random 
source positions, with each position assigned a 24-|am flux density 
from the catalogue at random. The radio flux density in each bin is 
consistent with zero. 

Figure 5. The 610-MHz radio flux density for each source in the 24-|am 
catalogue, measured as in Section l2~T1 (grev dots), the values of ±4o~ (hori- 
zontal black dotted lines) for the central region of the image, and the median 
values of flux density within each bin (black points with errors). The error 
bars denote the size of the IQR of radio flux density in each bin. The de- 
crease in the number of sources seen below a 24- |om flux density of 300 |oJy 
is due to the smaller coverage area of the verification survey - see SectionfS] 
for more details. 



noise at the centre of the 610-MHz image). The median of the dis- 
tribution is again consistent with having zero flux density in each 
bin. 



3.2 Stacking by source type in the SWIRE fields 

We have carried out no selection by source type so far in this work, 
and have been stacking all 24-p.m sources together, rather than just 
those specifi cally associa t ed wi th star formation. The 24- [tm xFLS 
catalogue of iFadda et ail J2004h does not classify sources by type, 
and it is possible that there may be a significant number of stars 
or AGN in the sample, potentially affecting conclusions that are 
being drawn for the star-forming galaxy population. At 30 mjy the 
contribution of stars and galaxies to the 24- [im source counts is 
roughly the same, but galaxies rapidly dominate below this flux 
density, and stars do not make up a significant fraction of the source 
counts over the 24 - ^tm flux density range that we are considering 
dShupe et alj2008l) . 

We carried out source stacking in the three northern SWIRE 
fields, using the GMRT 610-M Hz images of the ELAIS-N1 
dGarn et alj2008 at), L ockman Hole jGarn et al.l2008bh and ELAIS- 
N2 jGarn et ai] |2009) fields, and obtaining positional information 
on infrared sources from the photometric redshift catalogue of 
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Figure 6. Median values of q' 24 for sources within the ELAIS-N1 field, 
along with the IQR of the data. All sources from the Rowan-Robinson et al. 
12008) photometric catalogue are shown (thick black points), compared 
with sources that have been classified as star-forming (thin red points). The 
star-forming sources have been displaced by a small amount along the x- 
axis for clarity, but would otherwise lie directly on top of the median value 
calculated from all sources. No significant difference is seen between the 
two median values, although the distribution width is smaller for the star- 
forming sources, demonstrating that AGN contamination increases the un- 
certainty of the results. 



Rowan-Robinson et "aH d2008h . While the photometric catalogue is 
not completely unbiased, due to the selection requirements and 
the need for sufficient photometry to estimate redshifts, all sources 
within the catalogue have been classified as either galaxies or AGN 
(with no stars being present). The possibility that the xFLS sam- 
ple is contaminated by significant numbers of sources that are not 
star-forming galaxies can thus be tested. 

There are 48,882 entries in the photometric catalogue with 
24-p.m flux densities between 150 pjy and 10 mJy, within the 
region covered by the ELAIS-N1 image. Of these sources, 2,236 
(4.6 per cent) have been classified as fitting AGN templates through 
their optical and infrared photometry, although no information is 
available as to which sources may have a significant amount of 
their radio emission resulting from an AGN, but are not identi- 
fied as such in the infrared. Fig. [6] shows a comparison between 
q 24 calculated from all sources within the ELAIS-N1 field, and 
q' 24 from just those sources classified as star-forming. No signifi- 
cant difference is seen in the median value, with the values for the 
star-forming sources lying on top of the median values found when 
using all sources in the catalogue. The IQR decreases when only 
star-forming sources are used, demonstrating that the AGN sources 
increase the dispersion in the data, but that the median is resis- 
tant to the presence of a small percentage of outliers. The greater 
noise level for the ELAIS-N1 survey compared with the xFLS 610- 
MHz survey (~ 7 [ijy beam -1 across the majority of the image; 
iGarn et al]|2008al) means that the IQR can permit negative radio 
flux densities for some of the fainter infrared flux density bins - 
where this occurs, lower limits are shown for the IQR of q' 24 in 

Fig.m 

Repeating this test on the other two SWIRE fields, similar re- 
sults were found. The presence of known AGN sources in the sam- 
ple has very little effect on the stacking results, and we conclude 
that contamination from AGN sources in the xFLS sample will be 
unimportant. However, all future stacking within the SWIRE fields 
will use only those 24- pm sources identified as star-forming, with 
46,646 sources in ELAIS-N1, 32,265 in ELAIS-N2 and 28,042 in 
the Lockman Hole. 
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Figure 7. The median values of q'~> 4 from the xFLS (red upright crosses) and 
three SWIRE fields (ELAIS-N1 - blue stars; ELAIS-N2 - green triangles; 
Lockman Hole - grey squares). The error bars represent the error on the 
value of the median radio flux density, rather than the IQR. 



4 DISCUSSION 

4.1 SWIRE and xFLS comparison 

In Fig.UJwe show the 610-MHz stacking results from the xFLS and 
SWIRE fields. There is an apparent field-to-field variation seen in 
the values of q' 24 , with the xFLS data being systematically lower 
by ~ 0.3 than the SWIRE data. The offset affects all of the 24- 
\im flux density bins by approximately the same amount. The error 
estimates come from the uncertainty in median radio flux density, 
which scales as 1 /\/N and is equivalent to the error obtained from 
the noise level of a stacked image. Due to the dependence on \/~N, 
the fainter flux density bins (which contain many more sources) 
appear to be much better constrained than the brighter bins, where 
sources can actually be detected individually. 

There are a number of potential explanations for the offset 
which is seen: 

(i) A systematic offset in the flux density calibration in one or 
more of the radio images (Section[42]l. 

(ii) A systematic offset in the flux density calibration in one or 
more of the infrared catalogues (Section[43]t 

(iii) A bias being introduced into the stacking experiments due 
to sample incompleteness (Section r4.4t . 

4.2 Radio flux density calibration 

Any systematic flux density calibration offset in the radio surveys 
would directly translate into a systema tic offset in the calculat ed 
value of q 14 . All of the GMRT surveys dGarn et alj|2007l.l2008j|bl 
2009) were calibrated in the same way, using observations of 3C48 
or 3C286. The relative flux density calib ration of the xFLS survey is 
accurate to 7 per cent dGarn et al .120071) , and similar values apply to 
the three SWIRE survey fields. The effect of a constant calibration 
error is simple to quantify - if the measured radio flux density S m 
equals fx the true flux density S tme , then the measured value of q m 
is related to the true value gtrue via 

?m = ?true-logi (/)- (3) 

A 7 per cent flux density calibration error would lead to an error 
in q24 of ±0.03 - ten times too small to account for the difference 
seen in Fig.|7j 

The existence of a systematic calibration offset between the 
radio surveys can be tested for by considering the only those values 
of q' 24 which are calculated from infrared sources with counterparts 
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Figure 8. A comparison of q' 24 for each of the GMRT fields, using the de- 
tected radio sources only, with error bars representing the error on the me- 
dian value of radio flux density. The colour scheme is the same as for Fig. [7] 
While the varying biases affect the values of q' 24 found for faint infrared flux 
densities, no systematic offset is seen between the values of q' 24 in the four 
fields for bright sources, in contrast to the result seen in Fig. [7] when all 
infrared sources are used. 

in the GMRT catalogues from iGarn etail J2007L [2008allbl l2009h . 
Fig.[H]compares the median values of q' 24 from each of the GMRT 
images, using only those detected sources - while the results for 
faint infrared flux density bins are affected strongly by the same 
selection effects as were seen in Fig. Etb), no systematic offset is 
seen between the median values of q 24 for bright infrared sources, 
in contrast to that seen in Fig.|7j The existence of a systematic offset 
between the stacked values of q' 24 in the xFLS and SWIRE fields, 
which is not seen in the values of q' 24 calculated from sources which 
are detected in the radio images, rules out the possibility that the 
offset is a result of a difference in flux density calibration between 
the radio surveys. The median value of q' 24 from bright, detected 
sources for all fields is ~ 1 in the brightest 24-u.m bin, which is in 
agreement with the value found for the xFLS field using all 24- 
sources. 

An error due to the radio flux density calibration can also 
be ruled out by c onsidering the radio source counts presented in 
IGarn et all d2008al) - if the xFLS field was systematically brighter 
than the SWIRE fields, then the 610-MHz differential source counts 
would be expected to show a systematic offset between the xFLS 
and SWIRE fields; no such effect was seen. The GMRT source 
counts also agree with GMRT counts made by other authors, con- 
firming that the relative flux density calibration of the surveys is 
accurate. It seems unlikely that any large-scale flux density calibra- 
tion error can be the cause of the discrepancy between the xFLS 
and SWIRE survey data. 



4.3 Infrared flux density calibration 

The 24-jj.m catalogues for the xFLS and SWIRE fields have been 
created by separate authors (xFLS; iFadda et al.1 12004 SWIRE; 
Rowan-Ro binson et alj|2008h . While it is equally unlikely that any 
large-scale flux density calibration error in the infrared could be re- 
sponsible for the systematic effect which is seen, this can be tested 
through a stacking experiment in the xFLS and SWIRE fields using 

70-|i.m sources. 

There are 601 sources in the iFraver et alj ([2006!) xFLS cata- 
logue contained within the 610-MHz image, and 1028, 597 and 687 
sources from the Sura ce et al. I {US) SWIRE catalogues within the 
ELAIS-N1, ELAIS-N2 and Lockman Hole images. We carried out 
the same stacking procedure, using 70- (im source positions, in or- 




1 I ' ' ' ' — ' — ' — ■— I 

0.01 0.02 0.03 0.06 0.1 

70-fim flux density (Jy) 

Figure 9. Median values of q' 10 , along with errors in the median, taken from 
the four GMRT surveys. The colour scheme is the same as for Fig. [7] The 
ho rizontal black lines rep resent the mean value of q-jo = 2.16±0.17 found 
bv lAppleton et al.l 120041) . converted to a 610-MHz value using Equation [4] 
and a = 0.4 - see Section l4~4l 



der to compare the results to the 24-p.m data. Fig.[9]shows the q' 1Q 
values for the four fields. The error bars on q' 1Q are greater than 
before, due to the decreased number of sources in the infrared cata- 
logues. The same field-to-field variation is seen, with a similar sys- 
tematic offset of ~ 0.3 between the xFLS and SWIRE median val- 
ues of q' 10 , and no variation in the value of q' 10 with 570 is seen over 
the range 10-100 mjy for any of the survey fields. The similar off- 
set seen in q' 24 and q' in implies that the offs et is not a result of a dif- 
ferenc e between the lFadda et aljj2004t) and lRowan-Robinson et al.l 

d2008h 24- ^.m catal ogues. 

We overlay the Applet on et~ai1 d2004l) result of q 10 = 2.16± 
0.17 on Fig. [9] converted to a 610-MHz value using Equation [4] 
and a = 0.4 (see Section l4~4t . The lower sensitivity of the 70-jJ.m 
data compared with the 24-p.m observations (~ 30 mjy compared 
with ~ 0.3 mjy) makes the sample bias that occurs from only us- 
ing sources that can be detected in t he radio less significan t than 
at 24-(i.m. As was seen at 24-p.m, the I Appleton et all 12004) result 
agrees well with the xFLS stacking experiment, al though our data 
covers a grea ter range of lQ-\im flux density than Applet on et al.l 
d2004l) . as the lFraver et ail J2006h source catalogue of the field was 
not yet avai lable. The good agreem ent between our stacked q' 1Q val- 
ues and the lAppleton et al.l ([2004) value is further evidence that the 
stacking procedure is not significantly biasing the radio flux density 
measurements. 



4.4 Stacking bias 

IWhite et all d2007l) carried out a stacking experiment using the 
FIRST survey, and ran simulations to test how well their stack- 
ing experiment could recover the flux density of sources below the 
noise level. They placed four artificial sources in their uv data for 
each of 400 FIRST observations, and re-reduced the images to fully 
simulate all of the potential sources of bias that may enter into the 
data reduction procedure. They found that some of the initial flux 
density could not be recovered from stacked images, and attribute 
the lo ss to an effect equivalent to 'CLEAN bias' (e.g. lBecker et al.l 
1 19951) . despite the fact that sources below the survey threshold 'by 
definition, have not been CLEANed' (although see Section |4~5l for 
a disc ussion of the stacking simulation c arried out bylBovle et al.l 
2007). However, the second technique that l White et alJd2007l) used 
to test for a loss of flux was to take existing deeper observations 
of an area covered by FIRST (the VLA xFLS survey) to provide 
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Figure 10. A comparison between 924 obtained from the 1.4-GHz image 
of the xFLS field (black circles) and q' lir from the 610-MHz image of the 
same region, shifted to a 1.4-GHz value through the use of Equation|4]and 
a = 0.4 (red upright crosses). Error bars represent the error on the median 
value of radio flux density. The shape of the two profiles is consistent across 
the full range of infrared flux density, ruling out any significant insteument- 
dependent or data reduction-dependent biases. 



positions and flux densities for sources below the FIRST detection 
threshold, and then stack sources using the prior knowledge of their 
radio flux density. They found that their stacked images were able 
to recreate 71 per cent of the true flux density of sources, for all flux 
density bins. If a constant fraction / of flux is being recovered for 
all sources, then the required correction factor is also just a constant 
fraction (in this case 1/0.71 = 1.41) for each infrared flux density 
bin, rather than varying in a complex manner from source to source. 
The overall effect of this bias would be to alter the absolute value 
of q24 which is measured, but without changing its dependence on 
in frared flux density. From Equation [3] the correction factor found 
bv lWhite et al.l d2007D would have led to a calculated value which is 
0. 15 above the true value of <?24- In order to obtain an offset of 0.3, 
the correction factor would have to be log;o(0.3) ~ 2, with only 
half of the radio flux density of stacked sources being correctly 
measured. 

The linearity of any potential stacking bias can be directly test- 
ing within the xFLS field, where the same 24-p.m catalogue has 
been used to stack both 1.4-GHz and 610-MHz sources. Any non- 
linear form of bias would be expected to affect the two radio images 
in different ways, due to the separate telescope arrays used to take 
the observations, the independent data reduction methods, and the 
different amount of CLEANing. The relationship between ^24 an d 
<?24 , taking a single spectral index a to represent the whole popula- 
tiorQ, is 

424 = <724 - lo gio ( 1^- ) =g 2 4 + -36a. (4) 

\J610/ 

By comparing the binned values of 524 an d I24 found in the 
xFLS field, we are able to obtain a single spectral index, a = 0.4, 
which represents the linear shift required to convert between the 
stacked 610-MHz and 1.4-GHz flux densities. Fig. 1101 shows the 
values of <?24 calculated from the xFLS 1.4-GHz image, and the 
values of q' 24 calculated from the xFLS 610-MHz image, converted 
to 1.4-GHz values using a = 0.4. The shape of the 524 anu (shifted) 
^24 data are consistent across the full range of infrared flux den- 
sity, with the exception of the faintest flux density bin. While the 

1 We define a such that the variation of flux density S v with frequency V 
isS v =S v- a . 
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value of a has been chosen in order to overlay the ^24 an d <?24 
profiles, making the absolute values shown in Fig.[lO]not indepen- 
dent of each other, the consistent shape of the 1724 an d <?24 P ro " 
files in Fig.llOldemonstrates that any stacking bias which is present 
must be a si mple fraction of the true flux density, as was found by 
IWhite et al.l d2007l) . and rules out any significant non-linear flux- 
dependent stacking bias. 



4.5 Comparison with the results of lflovle et alj d2007l) 

The stacking experiment of iBovle et al, I d2007l) is similar to that 
presented in this work, also using 24-p.m sources detected in the 
SWIRE survey, in the Chandra Deep Field South (CDFS) and 
ELAIS-S1 fields. The infrared observations of the ELAIS-S1 field 
were much less sensitive than the CDFS observations, and the re- 
sults from the two fields were broadly similar, so we only compare 
our results to those from the C DFS field. 

The IBovle et all j2007h radio data came from a 1.4-GHz 
Australia Telescope Compact Array (ATCA) survey, with an rms 
noise level of 30 pJy beam -1 and resolution of 11x5 arcsec 2 
dNorris et alj|2006h . The Spitzer catalogue of the CDFS field con- 
tains ~ 12,000 sources above an infrared 5 a flux density limit of 
100 [jJy (while the ELAIS-S1 survey only contains ~2,000 sources 
above a flux density limit of 400 [jJy). Median stacked radio im- 
ages were created from the binned CDFS sources, and the peak 
flux density of these images was taken as their estimate of the radio 
flux density. They select data in two ways - an 'all-source' sample, 
where all the data was used, and a 'quiet-source' sample, where 
cut-out images that had a central pixel with radio flux density of 
> 100 pjy were not used to create the stacked image. Due to the 
exclusion of visibly-present radio sources, the 'quiet- source' stack 
suffers from a systematic bias in the brighter flux density bins, and 
underestimates the radio flux density for the stacked sources (as 
can be seen in the ir Fig. 4) - for this reason, all comparisons to 
IBovle et al] d2007l) data in this work are with the 'all-source' sam- 
ple. 

IBovle et al] |2007) found a value of ^24 = 1-39, and no sig- 
nificant variation with S24 over t he range 100 - 280 (jJy. Fig. [TTI 
shows the values of ^24 found by IBovle et alj J2007I) in the CDFS, 
and the values of from the three 610-MHz SWIRE fields, con- 
verted to 1 .4-GHz values using the spectral ind ex of a = 0.4 found 
from the xFLS comparison in Section |4~4l The lBovle et al.l ( 2007) 
results agree with the general trend of the 610-MHz SWIRE re- 
sults, although the data in individual flux density bins are not al- 
ways completely consistent. The errors on the fainter flux density 
bin s from the 610-MHz SWIRE fields are significantly smaller than 
the IBovle et alj ^2007) dat a, due to the increased num ber of 24- 
(j.m sources present in the Rowan-Robinson et al. ( 2008) data com- 
pa red with the earlierlSurace et alj d2005l) SWIRE catalogues used 
bv lBovle et ail d2007h. 

iBeswick et al. 1 720081) argued that some of the discrepancy 
between their resu lts (^24 = 0.48; see Section |4~6l and those of 
IBovle et al] d2007l) could be due to a systematic under-estimation 
of the radio flux density by up to a factor of 2 by the ATCA for 
sources a t low flux densities . This would lead to an over-estimation 
of q24 bv lBovle et alj {2007) of ~ 0. 3, reducing their value to ~ 1.1 
in the faint 24- [im flux density bins (and therefore agreeing with 
our xFLS data). However, our findings are not consistent with this 
interpretation, wi th our SWIRE stacking results agreeing with the 
IBovle et al] d2007h data after using a conversion between 6 1 0-MHz 
and 1.4-GHz result s calculated from th e xFLS field, independent 
of the SWIRE data. iNorris et alj d2006t) found through comparing 



their radio data to a more sensitive surv ey of the GOODS-South 
portion of the region dAfonso et~ai]|2006l) that the ATCA flux den- 
sities were only being under-estimated by about 14 per ce nt, which 
would lead to an over-estimation of the lBovle et alj q2007t) value of 
<?24 by ~ 0.07 - this amount of variation is within the typical field- 
to-field dispersion seen from the three 610-MHz SWIRE surveys, 
and cannot be ruled out. While any underestimation of ATCA radio 
flux densities has been shown to be small at high flux density, this 
effe ct may be more signifi cant for lower flux density sources - see 
e.g. IPrandoni et al] (2000) - which would affect the faint stacked 
sources (< 130 (i-Jy) studied by Boyl e et all d2007l) more signifi- 
cantly than the br ighter discrete objects (> 200 pJy) discussed by 
INorris et alj d20061) . It remains unclear as to the potential size or 
cause of any systematic underestimation of radio flux densities in 
ATC A observations. 

IBovle et alj (2007) recognized that their value of ^24 =1-39 
was unexpectedly high - 'greater than any modelled SED ', and so 
carried out simulations to test for any effects such as the stacking 
bias described in Section [4~4| They inserted sources into their im- 
age, and confirmed that they were able to recreate the median value 
of the source population through the use of stacking. More impor- 
tantly in the context of this work, they inserted sources into their 
uv data and reprocessed the images to fully simulate all of the ef- 
fects which could be biasing the stacked flux density values. Their 
stacking measurements gave them 'an essentially identical result' 
to simulations in the image plane, and to their original data, which 
suggests that any stacking bias in their experiment must be small. 

The fact that the same conversion facto r which was found 
for the xFLS data (of a = 0.4) also allows the lBovle etall d2007l) 
and 610-MHz SWIRE data to agree implies that there can be little 
stacking bias in the 610-MHz surveys. Any significant bias which 
only applied to the GMRT xFLS image would lead to a value of a 
which would not be suitable for the SWIRE fields, contrary to what 
is seen. We do not believe that a stacking bias can be the cause of 
the observed discrepancy between the xFLS and SWIRE results. 



4.6 Comparison with the results of lBeswick et alj 1 2008) 

IBeswick et alj (|2008) performed their stacking experiment on a 
sample of 377 24-iim sources detected with Spitzer in the Hub- 
ble Deep Field-North (HDF-N), 303 of which can be seen above 
3cr in a deep Multi-Element Radio-Linked Interferometer Network 
(MERLIN) and VLA image of the field with rms noise level of 
3.6 nJy beam -1 and resolution of 0.4 arcsec. They measured radio 
flux densities using both of the techniques describ ed earlier, and 
obtain ed comparable results with the two methods. IBeswick et alj 
(2008) found a median value of 524 = 0.48 for all sources, and a 
'tentative deviation in the infrared / radio correlation at very low 
24-p.m flux densities, with the fainter sources having lower values 
of g24- Fig. II II shows all of the stacking results presented in this 
work, conv erted to a 1 ,4-GHz valu e where appropriate, along with 
those from lBovle et alj d2007l) and IBeswick et al j d2008l) . In order 
to display the data more clearly we focus on the 24-p.m flux density 
range between 0. 1 and 2 mJy. 

There is a clear discrepancy between the results of 

IBeswick et al. I d2008h . the xFLS results, and the results from the 
SWIRE surveys. The most noticeable effect is that the radio sur- 
ve ys with the lowest no ise levels show the lowest values of <?24 _ 
the IBeswick et alj d2008l) survey is the deepest, at 3.6 pJy beam -1 , 
while the xFLS surveys are the next most sensitive, and the SWIRE 
fields are the least sensitive. However, the noise levels of the 
VLA xFLS survey (23 nJy beam -1 ) and the ATCA CDFS sur- 



10 T. Gam & P. Alexander 




icr 4 3 x icr 4 icr 3 



24-jum flux density (Jy) 

Figure 11. A comparison of all of the stacking results discussed in this 
work, using data shifted to 1.4 GHz where appropriate through the use 
of Equation [4] and a = 0.8. The stacking results from this work are given 
by thin data points with error bars, with the VLA xFLS image (black cir- 
cles), GMRT xFLS image (red upright crosses), GMRT ELAIS-N1 image 
(blue stars), GMRT ELAIS-N2 image (green triangles), and GMRT Lock- 
man Hole image ( grey squares) being shown, along with results from the 
iBovle etaljj2007h ATCA CDFS image (thick pink points; 'all-source' data) 
and the lBeswick et alj 120081) MERLIN+VLA HDF-N image (thick orange 
solid line). All error bars represent the error on the median value of radio 
flux density. 
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Figure 12. Histograms of the flux density recorded within N random aper- 
tures in each of the 610-MHz surveys. No aperture correction factor has 
been applied to the measured flux densities. 



vey (30 pJy beam ) are much more similar than the varia- 
tion in noise between the 610-MHz xFLS and ELAIS-N2 images 
(30 pJy beam -1 and 80 pJy beam -1 ), and yet the stacking results 
from those fields are still clearly inconsistent with each other, sug- 
gesting that this cannot be the only factor which is important. 

In order to test whether the stacking results are being biased 
by the noise level of the radio image, the distribution of flux density 
within N random apertures was calculated for each of the GMRT 
images, and shown in Fig. [12] No aperture correction was applied 
to these flux density measurements, since applying the aperture cor- 
rection only broadens each distribution by a factor of between 1.4 
and 1.5, and does not affect the conclusions of this section. The 
slightly varying noise levels near to the edge of each image mean 
that a single Gaussian can not completely represent the flux density 
distribution, but there is no evidence for any asymmetry or signifi- 
cant non-Gaussianity in the measured values of flux density for any 
of the fields. The effects of noise on the stacking procedure will not 
systematically bias the measured values of radio flux density, and 
can not be responsible for the systematic difference in q' 24 which is 
seen between fields. 

There may be a significant number of radio-bright AGN in 
the HDF-N sample, which would lead to an decrease in ^24- Any 
AGN contamination would have to be large, since the 4.6 per cent 
of AGN sources within the ELAIS-N1 field made essentially no 
difference to the calculated value of q' 24 in Section [3~2l and the 
median is naturally resistant to being affected by small numbers 
of outlier sources. All of the stacking studies presented here are 
infrared- selected, and the population would therefore be expected 
to be dominated by star-forming systems - this makes it unlikely, 
although not impo ssible, that large amoun ts of AGN contamination 
will be a problem. iBiggs & Ivisonl J20061) have shown that the dif- 
ferential number counts of radio sources within the HDF-N are con- 
sistent with source counts from other deep observations - although 
this does not provide any information directly about the brightness 
of infrared-selected sources, it does show that the HDF-N is not an 
anomalously radio-brig ht region of sky. 

80 per cent of the lBeswick et al.1 {2008) sample are detected in 



the radio above 3o", making it much more complete than the sam- 
ples described in this work (the deepest of which has 2,558/14,820 
= 17 per cent radio detections above 4(7). It is important to note that 
we have applied no radio detection criteria to the stacked samples (a 
major advantage of stacking experiments), so no bias should be en- 
tering the analysis thro ugh incompleteness - however, if the 'stack- 
ing bias' identified by Whi te et al.l ( 120071) did vary with flux den- 
sity, or S/N, then a more incomplete sample may lead to a greater 
value of <224 being found. Either way, in order to change a value 
of ^24 = 0.48 to 1.39 (from Beswick to Boyle) would require from 
Equation [3] that on ly 12 per cent of th e true radio flux density was 
being measured bv lBovle etal] ( l2007h . while Beswick et ai]( l2008h 
were measuring the full value - it seems highly unlikely that such 
a large error could be occurring. There may be some stacking bias 
taking place, but we do not believe it can be responsible for the 
size of the observed difference between the IBeswick et aT] (2008) 
results and the other data. It would also not explain the difference 
betwee n the essent i ally c onstant value of <j24 found in this work 
and by Bovle et al. (2007), and the variation with 24-|j.m flux den- 
sity found by Beswick et al. ( 2008). 

The IBeswick et al.l {2008) sample is the only stacking study 
which is working with resolved radio sources - however, the flux 
density measurement method that they use should include all of 
the resolved flux density. Any loss of radio flux would lead to an 
increase in </24, rather than the lower value that they find compared 
with other works. 

None of the explanations for the discrepancy in stacking re- 
sults are satisfactory - while we believe that the most likely expla- 
nation is some combination of an effect due to the differing noise 
levels and a stacking bias, this remains speculation. Nonetheless, 
all the indications are that the same effect is occurring equally to 
all radio sources in a field, and so while the absolute values of (724 
and <^24 are uncertain, their dependence on infrared flux density will 
not be affected. We find no evidence for any variation in the value 
of 1724 or <?24 over me 24-jrm flux density range of 150-fi.Jy and 
10 mJy, and no evidence for a variation in the value of q'-j^ over the 
70- (xm flux density range 10 - 100 mJy. 
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5 CONCLUSIONS 

We have compared the two methods commonly used in stacking 
experiments, and demonstrated that they give comparable measure- 
ments of the average radio flux density for binned sources. We have 
shown that the median flux density is a much more reliable esti- 
mator than the mean, and that the noise level of stacked images 
decreases as l/y/N, at least until a depth of ~ 0.5 [iJy beam -1 at 
1.4 GHz. Creating stacked images leads to less information being 
retained on the flux density distribution of sources, and we argue 
that future stacking experiments should not make images, but in- 
stead work directly from the individual flux density measurements 
that can be calculated at each source position. 

We have calculated ^24 an d q'24 from sources within the xFLS 
field, and demonstrated that they can be related to each other with 
the simple assumption that all radio sources have the spectral index 
of a = 0.4, over the 24- [im flux density range of 150 |j.Jy - 10 mjy. 
Using this convers ion, we have compared the stacking results of 
iBovle et alJ d2007h to the results obtained from our three SWIRE 
field images, and demonstrated that they are consistent, within the 
errors that can be placed on the median stacked values of q24- There 
appears to be some field-to-field variation seen between the xFLS 
and SWIRE fields, with stacking results from the xFLS field lead- 
ing to a value of ^24 that is ~ 0.3 lower than the value seen in 
the SWIRE fields. This variation is also seen when stacking radio 
sources using the 70- c atalogues of each fiel d. Through com- 
parison with the results of App leton et alJ d2004l) . who look at in- 
dividual sources which are detected in the xFLS field, we conclude 
that our radio flux density measurements are not being significantly 
biased by the stacking procedure. 

We have considered several potential explanations for the dif- 
ference in the median values of q24, and shown that effects such 
as a flux calibration error cannot be responsible for the difference 
in results. While we can not explain the systematic offset which is 
being seen, we believe that it originates in some combination of a 
'stacking bias' and the varying noise levels of the different radio 
surveys. The systematic effect leads to an uncertainty in the abso- 
lute values of 524 an d qjo, but not in their variation with infrared 
flux density. 

We find no evidence for a variation in the median value of 
<724 from any of the survey fields down to a 24 -iim flux density o f 
150 iijy. This is in agreement with the results o flBovle et alJt2 007). 
althou gh is in contrast to the tentative findings of iBeswick et al.l 
d2008l) . We find a similar result at 70 \im, although over the brighter 
flux density range of 10 - 100 mjy. We conclude that there is no evi- 
dence for any significant variation in the infrared / radio correlation 
with infrared flux density, down to the limits set by the extremely 
sensitive Spitzer observations of the xFLS and SWIRE fields. 
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